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Abstract 



In this paper we study two related iterative randomized algorithms for distributed computation 
of averages. The first one is the recently proposed Broadcast Gossip Algorithm, in which at each 
r~^ , iteration one randomly selected node broadcasts its own state to its neighbors. The second algorithm 

is a novel de-synchronized version of the previous one, in which at each iteration every node is 
allowed to broadcast, with a given probability: hence this algorithm is affected by interference among 
messages. Both algorithms are proved to converge, and their performance is evaluated in terms of rate 
of convergence and asymptotical error: focusing on the behavior for large networks, we highlight the 
(-H \ role of topology and design parameters on the performance. Namely, we show that on fully-connected 

graphs the rate is bounded away from one, whereas the asymptotical error is bounded away from 
zero. On the contrary, on a wide class of locally-connected graphs, the rate goes to one and the 
asymptotical error goes to zero, as the size of the network grows larger. 



sj^ , 1 Introduction 

Q^ , When it comes to perform control and monitoring tasks through networked systems, a crucial role has to 

04 ■ be played by algorithms for distributed estimation, that is algorithms to collectively compute aggregate 

information from locally available data. Among these problems, a prototypical one is the distributed 
\f^ • computation of averages, also known as the average consensus problem. In the average consensus problem 

^D I each node of a network is given a real number, and the goal is for the nodes to iteratively converge to 

>— ^ ■ a good estimate of the average of these initial values, by repeatedly communicating and updating their 

states. 

Recently, an increasing interest has been devoted among the control and signal processing communi- 
ties to randomized algorithms able to solve the average consensus problem. This is motivated because 
randomized algorithms may offer better performance or robustness with respect to their deterministic 
C^ I counterparts. As well, randomized algorithms may require less or no synchronization among the nodes, a 

property which is often difficult to guarantee in the applications. Moreover, it may very well happen that 
the communication network itself be random, thus implying the need for a stochastic analysis. These 
facts are especially true when communication is obtained through a wireless network. For these reasons, 
the present paper will study the performance of a pair of notable randomized algorithms, in terms of their 
ability to approach average consensus. Among randomized algorithms, researchers have devised gossip 
algorithms, in which at each iteration only a random subset of the nodes performs communication and 
update. Among these algorithms, the Broadcast Gossip Algorithm has been recently proposed: at each 
time step one node, randomly selected from a uniform distribution over the nodes, broadcasts its current 
value to its neighbors. Each of its neighbors, in turn, updates its value to a convex combination of its 
previous value and the received one. This algorithm may seem to require a significant synchronization. 
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since the choice of the broadcasting node has to be done at the global level. However, it has been observed 
that this communication model is equivalent, up to a suitable scaling of time, to assume that each node 
broadcasts at time instants selected by a private Poisson process. Nevertheless, this equivalence is no 
longer true if broadcasting takes a finite duration of time. When this happens a node is, with non-zero 
probability, the target of more than one simultaneous communication and destructive collision may occur. 
This is especially true in wireless communications, which have to share their communication medium. 
Hence, the practical applicability of this algorithm in a distributed system resides cither on the possibility 
to incorporate some nontrivial collision detection scheme, or on the assumption that communications are 
instantaneous. The first contribution of this paper is to relax this assumption to allow a communication 
model in which more than one node can broadcast at the same time, possibly implying the interference 
of attempted communications. Thus, we introduce a novel distributed randomized algorithm for average 
computation, facing the issue of interference in communication. 

As a second contribution, we study both the original broadcast algorithm and the novel one, in terms 
of their asymptotical estimation error. Note, indeed, that the iterations of broadcast algorithms do not 
preserve the average of states, and in general do not converge to the initial average. Hence, it is crucial 
for the application to estimate such bias. In this paper, we prove that on sparse graphs with bounded 
degree, both algorithms are asymptotically unbiased, in the sense that the asymptotical errors go to zero 
as the network grows larger. Instead, on complete graphs, both algorithms are asymptotically biased. 
Moreover, for both algorithms we investigate significant trade-offs between speed of convergence and 
asymptotical performance. Our results are obtained via a mean square analysis, under the assumption 
that the communication graph possesses some structural symmetries, namely, that it is the Cayley graph 
of an Abelian group. 

Related works 

In latest years, many papers have dealt with distributed estimation and synchronization in networks. The 
latter problem, which partly motivates the interest for our algorithms, has been studied in several papers, 
also using approaches based on consensus, as in [25l[7]. Namely, an increasing interest has been devoted 
to randomized averaging algorithms. An influential pairwise gossip communication model is introduced 
in [23] and [6]. Moreover, the interest for wireless networks has induced several authors to consider gossip 
models based on broadcast, rather than on pairwise communication |17i [3] . Other gossip models have 
been developed, for instance, in [51 [T^ [^ [TU] . 

Broadcast and wireless communication inherently imply the issue of interference among simultane- 
ous communications. This has been considered since early times: actually, our communication model 
can be easily related with the slotted ALOHA protocol illustrated in [T]. More recently, the negative 
effects of interference on the connectivity of wireless networks have been discussed in various papers, 
for instance [13l [M]. The role of interference and message collisions in consensus problems, and the 
effectiveness of countermeasures, has been already investigated in the computer science community [9]: 
in the latter paper, consensus has to be achieved on a variable belonging to a finite set. Several related 
papers about real-valued average consensus problems have also appeared: we shall briefly review some 
of them. A few paper are concerned with design of cominunication protocols dealing with message colli- 
sions: for instance, |24| presents a data driven architecture which grants channel access to nodes based 
on their local data values. In j22| . a consensus algorithm allowing simultaneous quantized transmissions 
has been proposed: the issue of collisions is avoided by a suitable data-dependent coding scheme. In [27j . 
local additive interference is considered, and under some technical assumptions a collaborative consen- 
sus algorithm is proposed, which allows simultaneous transmissions and ensures energy savings. Besides 
interference, related robustness results against data losses in distributed systems have been presented, 
for instance, in [30 ( I18 j . Finally, the paper [2] proposes a related communication model, in which the 
broadcasted values are received or not with a probability which depends on the transmitter and receiver 
nodes, rather than on the activity of their neighbors. 

Our results build on the general mean square analysis developed for randomized consensus algorithms 
in |16 1 I17 j . and on results in algebra, linear algebra and probability theory. Indeed, our main results 



will assume that the network topology exhibits specific symmetries, in the sense that it is represented by 
the Cayley graph of an Abelian group. Cayley graphs have a long history in abstract mathematics [4], 
and they have been recently used in control theoretical applications, for instance in |291[2T|. to describe 
communication networks. Assuming Abelian Cayley topologies is motivated by their algebraic structure, 
which allows a formal mathematical treatment, as well as by the potential applications. Indeed, Abelian 
Cayley graphs are a simplified and idealized version of communications scenarios of practical interest. In 
particular, they capture the effects on performance of the strong constraint that, for many networks of 
interest, communication is local, not only in the sense of a little number of neighbors, but also with a bound 
on the geometric distance among connected agents. This is especially true for wireless networks: indeed, 
Abelian Cayley graphs have been related, for instance in [SIHHIIH]: to other models for wireless networks, 
as random geometric graphs or disk-graphs [20]. Moreover, it has to be noted that several topologies 
which appear in the applications are themselves Abelian Cayley: for instance, complete graphs, rings, 
toroidal grids, hypercubes. Finally, our main findings about the asymptotical error are based on a result 
in |15| about the limit of the invariant vectors of sequences of stochastic matrices: related results dealing 
with comparison of Markov chains have appeared, for instance, in |191 lllj . 

Paper structure 

After presenting the averaging problem and the algorithms under consideration in Section [51 we develop 
our analysis tools in Section [3] Later, Sections [4| and [5] are devoted to analyze the original broadcast 
gossip algorithm and the novel one, respectively. Some concluding remarks are presented in Section [6] 

Notations and preliminaries 

Given a set V of finite cardinality |V| = A^, we define a graph on this set as Q ~ (V, £), where £ CV xV 
(we exclude the presence of self-loops, namely edges of type {u,u)). Given u,v G V, if {v,u) G £, we 
shall say that v is an in-neighbor of m, and conversely u is an out-neighbor of v. We will denote by J\f^ 
and A/"", the set of, respectively, the out-neighbors and the in-neighbors of u. Also, d+ = \Af^\ and 
d:^ = lA/"^ I, are said to be the out-degree and the in-degree of node u, respectively. A graph whose nodes 
all have in-degree k is said to be fc-regular. A graph is said to be (strongly) connected if for any pair 
of nodes (u,v), one can find a path, that is an ordered list of edges, from u to v. A graph is said to 
be symmetric, if (w, v) £ £ implies (u, u) G £. In a symmetric graph, being the neighborhood relation 
symmetrical, there is no distinction between in- and out-neighbors and we will drop, consequently, the 
index -I- and — . We let 1 be the A^— vector whose entries are all 1, / be the N x N identity matrix, and 
il := / — A^^^ll*. Given a A^-vector a, we denote by diag(a) the diagonal matrix whose diagonal is equal 
to a. The adjacency matrix of the graph Q, denoted by Ag, is the matrix in{0,l}^^^ such that Ag^.^ = 1 
if and only if {v, u) G £. We also define the out-degree matrix as Di := diag (Agl), the in-degree matrix 
as Dg := dia.g{Agl), and the Laplacian matrix as Lg = Dg — Ag. The subscript Q will be usually 
skipped for the ease of notation. Given a matrix M g M^^^, we define the graph Qm = {V,£m) by 
putting (u, w) G £m iS V y^ w and My^y ^^ 0. A matrix M is said to be adapted to the graph Q = (V, £) 
if Gm C g, that is if £m ^ £. 

When it comes to compare two sequences {a„}„gN and {6n}neNj we shall write that a„ = o{bn) if 
limsup„ T^ = 0, that a„ = 0{b„) if limsup„ j^ < -foo, and that a„ = 6(&n) if there exist n G N and 
positive scalars ci,C2 such that ci6„ < an < C2&ji, for n > h. 

Given a linear operator L from a vector space to itself, for instance represented by a square matrix, we 
denote by sr(L) its spectral radius, that is the modulus of its largest in magnitude eigenvalue. Whenever 
sr(L) = 1, we shall define as csr(L) the modulus of the second largest eigenvalue in magnitude. 



2 Broadcast gossip averaging algorithms 

In this section we present the averaging problem and the algorithms we are dealing with. Let us be 
given a graph Q = (V, £) and a vector of real values x £ R^, assigned to the nodes. Then, the averaging 
problem consists in approximating the average -^ X^ugv *'^i" '^ith the constraint that at each time step 
each node v can communicate its current state to its out-neighbors only. The simplest solution to this 
problem consists in an iterative algorithm such that x(0) = x, and for all t G Z>o, x{t + 1) = Px{t), 
where P G M^^^ is a doubly stochastic matrix adapted to Q. Provided the diagonal of P is non-zero, and 
the graph Q is connected, the algorithm solves the averaging problem, in the sense that for every m S V, 

lim Xuit) =Xavc(i), 
t— >-foo 

where by definition Xavc(t) = ji J2vev ^v{t). 

However, it is clear that this algorithm potentially requires synchronous communication along all the 
edges of the graph. As this requirement may be difficult to meet in real applications, in this paper we 
shall study algorithms which require little or no synchronization among the agents. We assume from now 
on that the communication network be represented by a graph, denoted by Q = (V,f ), whose adjacency 
and Laplacian matrix will be denoted by A and L, respectively. 

We start recalling the Broadcast Gossip Algorithm [T71 [3] . In this algorithm, at each time step one 
node, randomly selected from a uniform distribution over the nodes, broadcasts its current value to its 
neighbors. Its neighbors, in turn, update their values to a convex combination of their previous values 
and the received ones. More formally, we can write the algorithm as follows. Note that the only design 
parameter is the weight given to the received value in the convex update. 

Broadcast Gossip Algorithm - Parameters: q e (0, 1) 
For alliGZ>o, 
1: Sample a node v from a uniform distribution over V 



for u eV do 

if u e TV^ then 

Xu{t + 1) = (1 - q)xu{t) + qxy{t) 
else 

Xu{t + 1) = Xu{t) 

end if 
end for 



This algorithm can also be written in the form of iterated matrix multiplication. Let v be the 
broadcasting node which has been sampled at time t. Then, x{t -I- 1) = P(t)x{t), where 

Pit) = I + q Y. {euel - euel), (1) 

and e,; is the i-th clement of the canonical basis of M^. Clearly, at each time t, the matrix P{t) is 
the realization of a uniformly distributed random variable, depending on the stochastic choice of the 
broadcasting node. 

The Broadcast Gossip algorithm has received a considerable, and has been extensively studied in [3] : 
in that paper, under the assumption that the communication graph is symmetric, the algorithm is shown 
to converge, and its speed of convergence is estimated. Instead, in this paper we shall concentrate on 
another crucial analysis question. Since the algorithm does not preserve the average of states Xavo through 
iterations, how far from the initial average the convergence value will be? 

As we noted in the introduction, the practical interest of the BGA algorithm depends on the assump- 
tion that the transmissions arc instantaneous, and reliable. In an effort towards more realistic commu- 
nication models, we propose a modification of the Broadcast Gossip Algorithm, which has the feature of 
dealing with the issue of finite-length transmissions, and consequent packet losses due to collisions. 



At each time step, each node wakes up, independently with probabihty p, and broadcasts its current 
state to all its out-neighbors. It is clear that some agents can be the target of more than one message: 
in this case, we assume that a destructive collision occurs, and no message is actually received by these 
agents. Moreover, interference prevents the broadcasting nodes from hearing any others (half-duplex 
constrainio) . If an agent u S V is able to receive a message from agent v, it updates its state to a convex 
combination with the received value, similarly to the standard BGA. 

More formally, the algorithm is as follows. 

Collision Broadcast Gossip Algorithm - Parameters: q £ (0, 1), p E (0, 1) 

For aU teZ>o, 
1; let Act be the random set defined by: for every u G V, P[v G Act] ~ p 
2: let Rec := {w G V : [A^^: n Act | = 1, u ^ Act} 
3: for all u G Rec, let a{u) be the only v £ V such that v G Act n A/'^^ 
4: for M G V do 
5: if u G Rec then 

6: Xuit + 1) = {1 - q)xu{t) + qx„(^u)(.t) 

7: else 

8: Xu{t + 1) = Xu{t) 

9: end if 
10: end for 

Also the latter algorithm can be written as matrix multiplication, defining 

P{t) = I + q Y. {euel-euel). (2) 

{v.u)e Act X Roc 

Both algorithms can actually be rewritten in the following graph-theoretic way. Let Q{t) be the 
subgraph of Q depicting the communications taking place at a certain instant t: the pair (u, v) is an edge 
in Q{t) iff V successfully receives a message from u at time t. Denote by A{t), D{t), Lit) the adjacency, 
degree and Laplacian matrices, respectively, of Q{t). Clearly, for both algorithms: 

P{t) = I - qL{t) (3) 

Several questions are natural for the collision-prone CBGA algorithm, in comparison with its syn- 
chronous collision-less counterpart. Does the algorithm converge? How fast? Does it preserve the average 
of states? If not, how far it goes? Is performance poorer because of interferences? 

We are going to answer the analysis questions we have posed, via a mean square analysis of the 
algorithm. Our interest will be mostly devoted to the properties of algorithms for large networks. To 
this goal, we shall often assume to have a sequence of graphs Qm of increasing order A^ G N, and we shall 
consider, for each A^ G N, the corresponding matrix P{t), which depends on Q and then on A^. Thus we 
will focus on studying the asymptotical properties of the algorithms as A^ goes to infinity. 

3 Mathematical models and techniques 

In this section we lay down some mathematical tools that can be used to analyze gossip and other 
randonfized algorithms. Namely, in Subsection 13.11 we review the mean square analysis in |17j . which 
is going to be applied to the BGA and CBGA algorithms in Sections |4] and [SJ respectively. Later, in 
Subsection l3.2l we introduce Abelian Cayley graphs and their properties, and in Subsection l3.3l we present 
perturbation results about sequences of stochastic matrices and their invariant vectors. 

^The half-duplex constraint is assumed throughout the paper: however, dropping it would imply minimal changes in the 
analysis. 



3.1 Mean square analysis 

Motivated by the interpretation of the broadcast algorithms as iterated multiphcations by random ma- 
trices, given in Equations ([1]) and ©, in this subsection we shall recall from [l7] some definitions and 
results for the analysis of randomized schemes, in which the vector of states x{t) S K^ evolves in time 
following an iterate x{t + l) = P{t)x{t), where {P{t)}t^z>o i^ ^ sequence of i.i.d. stochastic-matrix-valued 
random variables. Consequently, x{t) is a stochastic process. 

The sequence P{t) is said to achieve probabilistic consensus if for any a;(0) 6 R^, it exists a scalar 
random variable a such that almost surely lim(_j.oo x{t) = al. The following result, proved in |17j . is a 
simple and effective tool to prove the convergence of these randomized linear algorithms. Let P := E[P(i)]. 



Proposition 3.1 (Probabilistic consensus criterion) If P{t) is such that the graph Qp is strongly 
connected and, for all v G V, almost surely P{t)yy > 0, then P{t) achieves probabilistic consensus. 



For the rest of this section, we shall assume that the assumptions of Proposition 13.11 are satisfied. Then, 
in order to describe the speed of convergence of the algorithm, let us define d{t) := N^^\\x{t) ~ Xavc{t)'i-\\'^ , 
and the rate of convergence as 

i?:=suplimsupE[d(t)]i/*. (4) 

a:(0) t->+oo 

Let us consider the (linear) operator L : R^^^ — > R^^^ such that 

L(A/) =E[P{tyMP{t)]. 

Notice that E[d{t)] = E[x*{t)nx{t)], and that E[x* {t)nx{t)] = x*{0)A{t)x{0), with A(i) = L*(rj). Let TZ 
be the reachable space of the pair (L, fl). Then, 

■R = sr(L|K), 

and it has been proved in |17j that the rate can be estimated in terms of eigenvalues oi N x N matrices, 
as 

esr(P)2 <R< sr(L(0)). (5) 

It is clear that, if all P(t) matrices are doubly stochastic, x{t) converges to the initial average of states 
a;ave(0). If they are not, as in the cases we are studying in this paper, it is worth asking how far is the 
convergence value from the initial average. To study this bias in the estimation of the average, we let 
j3{t) = |a;avo(i) — a;avc(0)P, and we define a matrix B such that 

lim E[/3(i)] =.T(0)*Ba;(0). (6) 

Let Q{t) = P{t — 1) . . . P(0), so that x{t) = Q{t)x{0). The convergence of the algorithm is equivalent 
to the existence of a random variable p, taking values in R^, such that limf_j.oo Q{t) = Ip*. This implies 
that 

B = E[pp*] - 2N-'^E[p]V + N-^lV, 

where E[p] and E[p/9*] are the eigenvectors relative to 1 of P and L, respectively. In particular, if P is 
doubly stochastic, then 

B^E[pp*]-N-^ll*, (7) 

For any A'^-dimensional matrix A, E[pp*] = jrxT linrt-s-oo L*(A) : as a consequence, in the latter case 
the matrix B can be computed as 

B = N-^ \imL*{ll*)-N-^ll*. (8) 



Instead of computing B, it may be easier and significant to obtain results about some functional of B, for 
instance the spectral norm ||i3||2 or the trace tr(i?). The latter figure is of interest because, if we assume 
that the initial values Xi{0) are i.i.d. random variables with zero mean and variance cr^, then 

E[x{OyBx{0)]^a^tT{B). (9) 

Motivated by our interest for the properties of the algorithms on large networks, and by Equation JO]), 
we state the following definition. 

Definition 3.1 Given a sequence of graphs Qn, a randomized algorithm P{t) is said to be asymptotically 
unbiased if lim tvlB). 

3.2 Abelian Cayley graphs 

A special family of graphs is that of Abelian Cayley graphs, which are graphs representing a group, as 
follows. Let G be an Abelian group, considered with the additive notation, and let S* be a subset of G. 
Then, the Abelian Cayley graph generated by 5* in G is the graph Q{G^ S) having G as node set and 
£ = {{g,h) G G X G : h ~ g £ S} as edge set. Note that the graph G{G, S) is symmetric if and only if 
S is inverse-closed, and is connected if and only if S generates the group G. As well, a notion of Abelian 
Cayley matrix can be defined. Given a group G and a generating vector tt of length \G\, we shall define 
the Cayley matrix generated by n as cayl(7r);ig = iTh-g- Correspondingly, for a given Cayley matrix M, 
we shall denote by tt^^ the generating vector of the Cayley matrix M. Clearly, the adjacency matrices of 
G-Cayley graphs are G-Cayley matrices. Abelian Cayley graphs and matrices enjoy important properties: 
we refer the reader to [U [31] for more details. 

Example 3.2 Abelian Cayley graphs encompass several important examples. 

(i) The complete graph on N nodes, that is the graph where each node is directly connected with every 
other node, is Q{Zn,Zn \ {0}); 

(ii) The circulant graphs (resp. matrices) are Abelian Cayley graphs (resp. matrices) on the group Zat; 
we shall denote the circulant matrix generated by tt as circ(7r). For instance, he ring graph is the 
circulant graph Q{'L]^ ^ {^l? 1}); its adjacency matrix is ^ = circ([0, 1, 0, . . . , 0, 1]) and its Laplacian 
is i = circ([2, — 1, 0, . . . , 0, —1]). For a ring, the eigenvalues of L are {2(1 — cos (^^))}/eZjv i and in 
particular Ai = ^^ + o (■^) as A^ — > +oo. 

(iii) The square grids on a d-dimensional torus are 5(Z^,{ei,— ei},;g{i^ ,j}), where e^ are elements of 
the canonical basis of M''. In particular, the n-dimensional hypercube graph is Q{1i2, {ei}i£{i,...,n})- 

Notice how all examples above are naturally forming a sequence of graphs indicizcd by the number of 
nodes N . Special cases for which wc will be able to prove asymptotically unbiascdncss in the following, 
is when the generating set S is finite and "kept fixed" as in the ring graph. Precisely we consider the 
following general example 

Example 3.3 Start from an infinite lattice V = 1/^ and fix a finite S C Z'^\{0} generating lA as a group. 
For every integer n, let Vn = [— n, uY considered as the Abelian group 'Z^.n+i ^^d let C/(") be the Cayley 
Abelian graph generated hy Sn = S r\ [—n^nY. Notice that all graphs ^'"^ have the same generating set 
S for n sufficiently large, in particular they have the same degree. Moreover, by the assumption made 
on S*, all of them arc strongly connected. Rings and grids fit in this framework. 

A G-Cayley structure for the communication graph Q has deep consequences on the mean square 
analysis of randomized consensus algorithms. In particular, it is easy to see that if G is a G-Cayley 
matrix, then also L(G) is G-Caylcy. To exploit this property, let us define a sequence of matrices by 



the following recursion. Let A(0) = 7V~^11* and A{t + 1) = L(A(t)). Since 11* is a Cayley matrix 
on any Abelian group, the above fact implies that for every t e Z>o, the matrices A(i) are G-Cayley. 
Thus, the sequence A(t) can be equivalently seen as the sequence of the corresponding generating vectors 
■""(*) = 6o^(*)- We shall refer to this vector as the MSA vector. Since L is linear, the MSA vector 
evolution can be written as a matrix multiplication 7r(t + 1) = M'!T{t). Clearly, 

R = esr(Af). (10) 

Moreover, M is ^-stochastic, and if we let tt' be the right invariant vector of M, that is 




then, B can be computed using the fact that stochastic Cayley matrices are also doubly stochastic and 
Equation ([8]), obtaining 

B=— cayl(7r' 1|, trB = 7r'-— . (11) 

The following simple result will be useful later on. 

Lemma 3.4 Let Q he G-Cayley, and suppose that P{t)uu > 5 > Q almost surely. Then, 

Qa ^ ^A/* ^ Qa+A'+A'A 

Proof: A straightforward computation shows that 

Af„„ = ^E[P(t)fc+,,„P(i)fco] 

k 

Hence, M„„ > implies that there exists k such that P{t)k+v,u > and P{t)ko- If fc = 0, this yields 
Avu > 0. li k + v = u, then Au-v,o > or also A^v > 0. Finally, if both cases above do not happen, then, 
Ak+v,u > and Ak+v,v > which yields {A*A)uv > 0. The second inclusion is thus proven. To prove 
the first one, notice that, by the assumption made, M„„ > SK[P{t)vu]- This completes the proof. D 

Notice that in the EGA and CBGA examples we can always apply Lemma [3.41 with S = 1 — q. 

3.3 Local perturbation of stochastic matrices 

In this section we recall a perturbation result presented in |15] and which will be used later to estimate 
the trace of the matrix B and to prove asymptotic unbiasedness for sequences of Cayley graphs. 

We assume we have fixed an infinite universe set V, an increasing sequence Vn of finite cardinality 
subsets of V such that U„Ki = V and a sequence of irreducible stochastic matrices P*^"^ on the state 
spaces Vn with the following stabilizing property; for every i G V, there exist n{i) G N such that i G Ki(i) 
and 

p(n) ^ pini.)) ^ Vn > n(z) , V.7 G y„(.) (12) 

This property allows us to define, in a natural way, a limit stochastic matrix on V. For every i,j G V, we 
define 

,(oo) ^ f 4"^*» if j G K(.) 



'^ 1 otherwise ^ ^ 



The sequence of stochastic matrices p(") is said to be weakly democratic if the corresponding invariant 
vectors tt^"^ are such that, for all « G V, tt^ — >■ for n -^ +00. Fix now a finite subset W C n„Ki and 
another sequence of irreducible stochastic matrices P^"^ on Vn such that 



P^=P^ VzGK\W-,VjGK 

(14) 



In other terms, P*^"-* can be seen as a perturbed version of P^"-* with the perturbation confined to the 
fixed subset W and stable (it does not change as n increases). Also for this perturbed sequence we can 
define, following ([T3| . the asymptotic chain p(°°). The following result has been proven in |15) . 

Theorem 3.5 Suppose that p(°°) and P^°°> are both irreducible. Then, if P^"^' is weakly democratic, 
also p(") is weakly democratic. 

In the sequel we will use this result to prove the convergence to of the trace in (fTTj) . 

4 Broadcast without collisions 

In this section we present a comprehensive analysis of the Broadcasting Gossip Algorithm, in terms of 
both rate of convergence and bias. The following result characterizes the convergence properties of the 
algorithm, extending [3l Lemma 2 and 4] to directed networks. 

Proposition 4.1 (Convergence of BGA algorithm) Consider BGA algorithm. Let Q be any con- 
nected graph, L its Laplacian matrix, and Ai the smallest positive eigenvalue of L. Then 

P ^ I -qN-^L (15) 

L{n) = SI - g(l - q)N-^{L + L*) + qN-'^{L*ll* + 11* L) - q^N-^{D+ - A){D+ - A*). (16) 

In particular, BGA algorithm achieves probabilistic consensus. 
Proof: First, we compute P, as 

Then, by Proposition l3.1[ the algorithm achieves probabilistic consensus. To compute L{il) = E[P{t)*ilP{t)], 
we notice that e* Cj = Sij , and we compute 

E[p(t)*p(t)] =^ E ( ^ + 9 E (^«<' - ^«<) ] ( ^ + <? E (^-<' - ^-o 

=^- ;^9(i + -^*) + 9^;^E E {evsl ~ e^el - eycl + eue*J 

=I-j^q{L + L*) + q^j^(L + L*); 
and 

E[P(t)*irP(t)]-l^ M+9 E (e«<-e«e:) irll + q Y. i^^^ ~ ^^O 

= 11* - —q{L* 11* + 11* L) + q'^—Y^ E (^i-e* - e^e^, - e„e; + e„ej;) 



'"6V u,weAf+ 



--ll*-^q{L*ll* + ll*L) 



+'^^]v E('^^)^^^'^^~E'^^ E e^''3^~E^^ E "^^e^ + E E ^""^^ 

IV - —q (L*ll* + 11*L) + g2 J_((£)+)2 _ £,+^* _ ^23+ + A^*). 



n 

The next corollary provides explicit bounds on the convergence rate, assuming the communication 
graph to be symmetric. 



Corollary 4.2 Under the assumptions of Proposition \4^ if G is .symmetric, then 

P^I -qN-^L L{fl) ^n-2q{l~q)N-^L-q^N-^L^. (17) 

In particular, the convergence rate can be estimated as 

Proof: Specializing Proposition 231 and combining ([T71) with ([S]), we get 



2g (l-g) , g' ,2 ^ . 2g(l - g) 

N ^ 7V2 1 - N 



D 

Once we have understood that the algorithm converges, and how fast, it is worth to discuss to which 

value it converges. It is clear that the average is not preserved through each iteration, although the 

average can sometimes be preserved in expectation. Indeed, E [a;avc(i + l)|a^(i)] ~ N~^l*Px{t), so that 

if Q is symmetric, 

E[Xavc(0]=a:avc(0), Vt > 0. 

However, this weak preservation property docs not imply that the expected estimation error E[/3(t)] be 
zero, neither in finite time nor as time goes to infinity. In facts, the following example, derived from |16j . 
shows that limt E[/3{t)] can be positive and, furthermore, it can be bounded away from zero, uniformly 
in the network size N. 

Example 4.3 (Complete graph) Let ^ be a complete graph, and consider the BGA algorithm. In 
this case, the operator L can be computed explicitly, giving 

R={i-qr 

2-qN^ N ' 

Namely, tri? = ij^ (l — -i-) and hence the BGA is not asymptotically unbiased on the complete graph. 
Note that by changing the parameter g, one can trade off speed and estimation bias: this is numerically 
investigated in Figure [TJ Similar trade-offs will be considered throughout the paper. 

Next, we focus on Abelian Cayley graphs. 

Lemma 4.4 Consider the BGA algorithm and let the communication graph Q be Abelian Gayley with 
degree d. Then, the MSA vector -k evolves as 

7r(i + l) = (C + r)7r(t), (19) 

where the matrix 

C = /-2^(L + L*) 

is Cayley and T is a matrix such that T = — T where T does not depend neither on q nor explicitly on 

N but only on S, and is such that the number of non-zero rows is at most d"^ and non-zero columns is at 
most d"^ — d + 1. 
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Figure 1: Trade-off in the BGA between tr(_B) and R as functions of q, for a complete graph with N = 30. 



Proof: Using the fact that all matrices are Abelian Cayley, we obtain 

A(t + l)=E[P(t)*A(i)P(i)] 

=A(t) - qE[L{t)* A{t)] - qE[A{t)L{t)] + q^E[L{t)* A{t)L{t)] 

[l-±iL + L*))Ait)+q'^J2 E E (.e,el-e,el)Ait)ie,e;-e,el) 



geG h:h~geS k:k-geS 

Since 7r(t) = A{t)e^ (being A(i) symmetric for every t), we easily obtain from above that 

Trit + l)={I-q-{L + L*))7T{t) + -q^Yl E E ^/.-fe Wlege^ - e^e^ - e,,e; + e,,e^)eo 

geG h-.h-geS k:k~g£S 



h:heS k:keS 



ge-S h:h-geS 



From this we immediately see that the non-zero elements of T have row indices in (S — S)US and column 
indices in S — S. Hence the result follows. 

D 
Note that, in general, C will not be a stochastic matrix since it may be negative on the diagonal, however 
for large enough N (and d fixed) it is surely a stochastic matrix. Note, moreover, that the entries of the 
matrix T in p9p are proportional to N^^ and, moreover, the number of the non-zero entries is upper 
bounded by (1 -t- cP)'^, which does not depend on N. Hence, we expect that, as N diverges, T would 
become negligible, and the MSA would depend on the matrix C only. This would imply the unbiasedness 
of the algorithm, because it is immediate to remark that the invariant vector of C is N~^l. This fact can 
be actually stated as the following result. 

Theorem 4.5 (Unbiasedness of BGA) Fix a finite S C Z'^\{0} generating IJ^ as a group. For every 
integer n, let Vn = [—n,n] considered as the Abelian group Zjn+i and let Q^"' be the Cayley Abelian 
graph generated by Sn = S H [—n,n]'^. On the sequence of Q^^' the BGA is asymptotically unbiased. 



Proof: The idea is to apply the perturbation result Theorem 13.51 to the sequence of matrices C* 
and (C + T)*. Notice that Gc = 6?^"^ while G^c+t)- 2 0^"^ by Lemma [3H Hence, Gc- and G{c+Tr 
are both strongly connected. This also implies that the limit graph on Z^ of the two sequences Gc* and 
G{c+T)' both contain (y(°°) which is simply the Abelian Cayley graph on Zf^ generated by S which is 
strongly connected by the assumption made. Finally, notice that C* is Abelian Cayley, hence obviously 
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weakly democratic, while Lemma |4]4] guarantees that {C + T)* is a finite perturbation of C* in the sense 
of Section [331 Hence also {C + T)* is weakly democratic. This yields, by (|11|) . 



tiB= \N-^ -TT^I < N- 



D 



The results about Cayley graphs can be made more specific in the following example. 

Example 4.6 (Ring graph) Note that in this case the Cayley graph is circulant. For the ring graph, 
Proposition 14. 1 1 implies that, for N large enough, 

87r2 ^ ^ ^^ 87r2 

and namely R = 1 — Q{^). Specializing the proof of Lemma 14.41 the evolution of 7r(t) can be written 
as 



1 



7:,{t + l) 
that is 7r(t + 1 

and 






Aq 



7T,{t) 



N N ^ 



i^J-l{t)+T^l + lit)) + ^ (2^oW +^2(t) +^-2W)<5oj, 



N 



(C + r)7r(t), with 



C = circ(l-^,2|-,0,...,0,2|-) 



N' iV' 



N' 



( 4 



N 



-2 





-2 








1 































1 0\ 









\ -2 



1 



0-20/ 



Thanks to these explicit formulas, we can numerically compute the rate and bias. Namely, about the rate 
we obtain that esr(C) < esr(C + T), and esr(C + T) = 1 — 0(A^~^). This means that the perturbation 
T does not significantly affect the rate for large N . Moreover, since esr(C) = 1 — 0(A^^^) and esr(C + 
T) - esr(C) = 1 - QiN'^) we argue that actually 



-'-^-(i^ 



as t 



On the other hand, about the bias we obtain that tr (B) = Q(N~^\ We can also numerically evaluate 
B and R as functions of q: these results are shown in Figure [H Note that choosing q we can trade off 
asymptotical error and convergence rate. D 



Remark 4.1 (Pareto optimality) Combining Examples 14.31 and 14.61 we obtain that tri? is propor- 
tional to ^~\-^ (l — -i-) on a complete graph, and to 7V'^(1 — i?) on a ring graph, considering the approx- 
imation for large N . In Figure [3] we plot tr_B against i?, for both graphs, up to a suitable scaling with 
respect to N . The fact that both curves are monotone decreasing highlights the fact that every value of 
q is Pareto optimal, in the sense that one can not improve one of the two objectives without making the 
other worse. 
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Figure 2: Trade-off in the BGA between tr (&) and R for a ring graph with N = 30. 
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Figure 3: tr_B as a function of i? on complete and ring graphs. 



Example 4.7 (Random geometric graph) In order to take into account the locality constraint on 
connectivity in real-world networks, several models of random geometric graphs have been proposed, as 
accounted in |20| . In this example we consider sequences of random geometric graphs, based on the 
following construction. For all A^ G N, we sample A^ points {zijigZjv from a uniform distribution over 

a unit square [0, 1]^, and we draw an edge (i, j) between nodes i, j G Zjv when \z 



-^,ll<0.8^i2^. 

On these realizations we run the BGA algorithm until convergence is reached, up to a small tolerance 
threshold, and in this way we compute an approximation of limf_>.oo /3(0- The results are plotted in 
Figure |4l in comparison with the analogous quantity on the complete and ring graph. It appears from 
simulations that, as TV diverges, /3 is 0(1) on the complete graph, whereas it is 0(Af^^) on the ring graph, 
and 0(A^^^/^) on the random geometric graph. These evidences are in accordance with the theoretical 
results, and suggest their extension to other families of geometric graphs. 



5 Broadcast with collisions 



In this section we present a comprehensive analysis of the Collision Broadcast Gossip Algorithm, in terms 
of both rate of convergence and bias. After proving a general convergence result, we focus on complete 
graphs in Subsection 15.11 on Abelian Cayley graphs in Subsection 15.21 and finally on ring graphs in 
Subscction l5.31 Our main finding is that the performance of the CBGA, in terms of both speed and bias, 
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Figure 4: The plot shows the asymptotical bias /3 as a function of N, computed by simulations on 
sequences of complete, random geometric, and ring graphs. Plotted values are the average over 1000 
runs. See text for more information. 

is close to the BGA: in this sense we may claim the robustness of broadcast gossip algorithms to local 
interferences. 

Proposition 5.1 (Convergence of CBGA algorithm) Consider the CBGA algorithm. Let Q be any 
connected graph, and L its Laplacian matrix. Then, 



P = I - qp{l - p)""' L, 



(20) 



where ((1— p) )ij = (1— p) '^ for every i and j . In particular, the CBGA algorithm achieves probabilistic 
consensus. 

Proof: The probability of having at time t a successful transmission from w to w is ¥[Auvit) = 1] = 
p(l -p)''" A„„. Hence, 

EiA{t))=p{l-pf'A. (21) 

Now, 

P = E[P{t)] = E[I + q{A{t) - D{t))] = / + qp{l - p)^' [A - D-]. 

Note that Qp is strongly connected: by Proposition [O] we can conclude the convergence of the CBGA. 

D 

Remark: Formula (|20p is simpler if the graph is d-regular, because in that case 

P = I -qp{l-pfL. 



(22) 

Denote by that Ai is the smallest nonzero eigenvalue of L. Then esr(P) = 1 — qp{l — pY'Xi, and this 
together with (O and [1 - qp{l -p)'^Xi]'^ > 1 - 2qp{l - p^Xi leads to 



R> l-2qp{l-p)'^\i 



(23) 



as a lower bound for the rate of convergence. As a function of p, this lower bound is minimal whenever 
P[Aij(i) = 1] is maximal, that is for p equal to p* = ^j^j. 

Hence, natural questions are: is this bound tight? is p* the best choice to improve the convergence 
rate? The content of the next section will answer positively these questions for complete and ring graphs. 
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5.1 Complete graphs 

A thorough analysis can be carried on for the complete graph. 

Proposition 5.2 (Complete graph) Let x{t) evolve following the CBGA algorithm, and let R and B 

he defined by (|4]) and ([TJ, respectively. Then, 



R = l-q{2- q)Np{l - p)^-^ 

2-qN^ N ' 

Namely, tr(_B) = j^ (l — jj) and then the CBGA is not asymptotically unbiased on the complete graph. 

Proof: It is immediate that in the complete graph either one node communicates to every others, or 
no node communicates. In the latter case, P{t) = I. The former case, which has probability Np(l—p)'^~^, 
leads to P{t) = Ph, where P(, = I + q'^u^yieu^*, ~ CmC*) and v is the realization of a random variable 
uniformly distributed over the nodes. Consequently, the analysis is quite close to the one of Example [ 
We note that 

and that 

E[P{t)*P(t)] =Np{l-p)^-^E[P;Pb] + (1 - Np{l-p)^-^)I 

-*-.)-'[.. 2,111^ . ,^ (^. - l,xr -.,. ^. 

+ (l - Npil - pf-^)I 
=Np{l - pf-'[I - 2q{l - q){I - HI)] + (1 - Np{l - p)^-')I 

= [1 - 2q{l - q)Np(l - p)^-l]/ + 2q(l - q)Np{l - P)"^"'^- 

and 

¥.[p{tyirp{t)] =Np{i-pf-^[ir + ^{iN- ifi - 2{n - i)(ir - /) 

+ (iV - 2)(11* - /) + (TV - 1)/)] + (1 - Np{l -pf-^)ll* 
=q^N^p{l - pf-^I + (1 - q^Np{l - p)^-^)ir. 

This means that the application of L keeps invariant the subspaces generated by / and 11*, and the 
linear operator L can be represented by the matrix 

l-2q{l~q)Np{l~p)^-^ q^Np{l-p)^-^ 
2q{l ~ q)Np{l - p)^-^ 1 ^ q^Np{l ~ p)^-^ 

The eigenvalues of this matrix are 1 and 

R=l- q^Np{l - p)^-^ - 2q{l - q)Np{l - p)^''^ 
^l-q{2-q)Np{l-p)'^-\ 

and the eigenspace relative to eigenvalue 1 is spanned by 



iNpH-pY , 2(1-,) 
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Since E[pp*] belongs to this eigenspace, and l*E[pp*]l = 1, we conclude that 

B = E\pp*] - N-Hl* = -^—-(I - —). 

U 
Some remarks are in order about the role of the parameters p, q in the CBGA algorithm on the 
complete graph. 

Remark 5.1 (Optimization) The convergence rate i? as a function of p is optimal for p* = 1/A^. The 
optimal value R{p*) = 1 — 9(2 — q){l — jq)^^^ is increasing in N and tends to 1 — q{2 — q)^ when N 
goes to infinity. Instead, if wc fix p = p, R{p) tends to 1 as A^ — > 00. On the other hand, B is the same 
as for the BGA in Example 14.31 and is independent of p. 

Prom the design point of view, it is clear that p has to be chosen equal to N~^, optimizing the 
speed. Instead, choosing q we trade off speed and asymptotic displacement: this trade-off is numerically 
investigated in Figure [5j 




Figure 5: Trade-off in the CBGA between tr(i?) and R as functions of q, for a complete graph with 
AT = 30 and p == l/N. 



5.2 Abelian Cay ley graphs 

In this subsection we consider the CBGA on Abelian Cayley graphs with bounded degree. The next 
result characterizes the mean square analysis of the algorithm. 

Lemma 5.3 Consider the CBGA algorithm and let the communication graph Q be Abelian Cayley with 
degree d. Then, the MSA vector n evolves as 



7r(t + l) = (C + T)7r(i), 



(24) 



where the matrix 



C^{I- qp{l - pfL*){I - qp{l - p)''L) 



is Cayley and T is a matrix whose entries do not depend on N and such that the number of non-zero 
rows is at most d^{d -\- 1)^ and non-zero columns is at most d^ . 

Proof: Using commutativity of Abelian Cayley matrices and (|2ip , wc obtain that 

A(t + !) = (/- qp{l - p)\L + L*))A{t) + q^E[L(ty A{t)L{t)] (25) 
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Passing to the generating vector, 

nit + !) = (/- qp{l - pf{L + L*)Mt) + q^finit) (26) 

where 

f{n{t)) := E[L{ty A{t)L{t)]e° = E[L{t)]* A{t)E[L{t)]e° +E[L(ty A{t)L{t)]e° - E[L{t)]* A{t)E[L{t)]e° 



Now, 
and 



h{7T{t))=p\l-pf''L*L7T{t) (27) 

[hiHt))]i - EhkML{t)himko]-E[L{t)hi]E[L{t)ko]7rit)h-k 

(28) 

= Etk mmk+umko] - E[L{t)k+tML{t)koMt)t 

Notice now that, by the way the model has been defined, we have that Aij{t) and A^kit) are independent 
whenever N~{i) n N~{h) = or equivalently i — h ^ S ~ S. In this case, also Lijit) and Lhk{t) are 
independent. Therefore, the double summation in (pS)) . can be restricted to fc G S* U {0} and t <E S — S. 
Consequently, the values of I for which [/2(7r(i))]i ^ can be restricted to (S U {0}) + S - S - (S U {0}). 
Plugging ((27| inside (|26)) and using the information on the structure of /2(7r(t)) obtained above, the 
result follows. 

n 

Thanks to Lemma [5.31 we can argue a result analogous to Theorem 14.51 

Theorem 5.4 (Unbiasedness of CBGA) Fix a finite S" C Z'' \ {0} generating 11^ as a group. For 
every integer n, let Vn = [—n,n]'^ considered as the Abelian group '^2n+i ^^'^ ^^^ ^ ^^ the Cayley 
Ahelian graph generated by Sn ^ S O [—n,n]'^. On the sequence of Q^"^' the CBGA is asymptotically 
unbiased. 

Proof: The idea is to apply the perturbation result Theorem 13. 5t o the sequence of matrices C* and 
(C + T)*. Notice that Qc- = Qa+A'+a-a 2 G^"^ while g(^c+Tr ^ ^^"^ by LemmaEH Hence, Gc* and 
G{c+T}* are both strongly connected. As in the proof of Theorem 14.51 this also implies that the limit 
graph on Z^ of the two sequences Gc* and t/^p^^j,-,, are also strongly connected. Finally, notice that C* 
is Cayley Abelian, hence obviously weakly democratic, while Lemma [4.41 guarantees that {C + T)* is a 
finite perturbation of C* in the sense of Section [3731 Hence also {C + T)* is weakly democratic. This 
yields 

trB= \N-^ -nl^l <N-'^ +TT'a ^ 0. 

D 
The form of T depends on the graph, and is complicated to compute in general. To give an example, 
in the next subsection we shall compute T for the ring graph. 

5.3 Ring graphs 

In this subsection, we specialize the results about about Cayley graphs to the case of ring graphs. In 
particular, we obtain tight bounds on its convergence rate. The following lemma can be proved on the 
lines of Lemma 15.31 bv a direct computation, which we omit. 

Lemma 5.5 Let G be a ring graph on N > 9 nodes. Let C be the doubly stochastic symmetric circulant 
matrix C = circ([l — 2ki — 2/c2, fci, ^2, 0, . . . , 0, fc2, fci]) with 



k^^2qp{\~p)\\~2qp{l~p) 

fc2 = gV(l-p)^ 
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and let 



/4-6p(l-p)2 4p(l-p)2 {1-pf 

4p(l-p)2-2 (l-6(l-p)2)p 2(2p-l)(l-p)2 

-p(l-p)2 4p(l-p)2-2|) (1 -p)(l-6p(l-p)) 



(l-(l-p)2)p -2p(l-p)(2p-l) 
P^{l-P) 









-p{i~p? 

V4p(l-p)2-2 -Pil-p)^ 



{1-pr 






p^l-p) 



M^-p? \ 
-p{i-pf 










-2p{l-p){2p-l) {l^{l-pf)p 
(l-p)(l-6p(l-p)) 4p(l-p)2-2p 
2{l^pf{2p-l) {l-Q[l-pf)p j 



Then, the MSA vector evolves as n{t + 1) = (C + T)'K(t). 



Numerical computations based on Lemma [5.51 show that tr(i3) = Q{N ^), that is the asymptotical 
error has the same dependence on N as for the BGA. 

Next result is about the rate of convergence, and shows that the bound on the convergence rate based 
on P is tight for the ring graph. 

Proposition 5.6 (Ring graph - Rate) Given a ring graph and CBGA algorithm, we have 

P = I-qp{l-pfL- 
L{a) ^n- 2q{l - q)p{l - pf L - q^p{l -pfN-'^L^ + q^p{l - pf N~'^pcuc{t), 

where 

T= [2(p-2),6-4p + /,-3(2-2p + /),2-4p + 3p^0,...,0,2-4p + 3/,-3(2-2p + /),6-4p + /]. 

Namely, 



Stt 



Stt 



1 - qp{l - pfj;^ <R<l-qil- q)p{l - pfj^ 



for N large enough. 

Proof: Computing P is easy. In order to compute Ij(J7), we first compute, using Lemma 15.51 

L(/)=circ((C + r)(l,0,...,0)*) 

= circ(l - 2fci - 2fc2 + (7'p(l -P)'(4 - 6p(l -p)'),^! + q^p{l - pf{Ap{l - pf - 2), 
fc2 - g'p(l - P)'p(l - p)2, 0, . . . , 0, fc2 - <7'p(l - P)'p(l - pf, 

h+qM^-pfiM^-pf-2)) 
= circ(l - 4g(l - q)p{l - pf,2p{l - pfqil - g), 0, . . . , 0, 2p(l - pfq{l - q)) 
= ^2q{\-q)p{l-pfL, 

and 

L(ll*) = circ((C + r)l) = 11* + circ(Tl). 

Then, by straightforward computations, 

L(f7) =0 - 2g(l - q)p{l - pfh - circ(ri) 

=VL - 2q{l - q)p{l - pf L - q^p{l - pfN-^L^ + gV(l - pfN"^ circ(T), 
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provided we define r as in the statement. 

Note that r*l = and that the eigenvalues of the circulant matrix circ(r) are 

} TO + 2 Y,n cos (^iM 

and those of L are {2 (l — cos {^l)) },py ■ As a consequence, the eigenvalues of L(51) are /io = 0, and 
fii =1 + 2p{l - p)^[-2q{l -q)- ^q'^N'^ + Toq^pN-^] 

+ 2p{l - pf[2q{l -q) + 4q^N-^ + Tiq^pN"^] COS ('^/'j + 2q^p{l - pfN-^[T2P - 1] COS (^l 



+ 2T^q^p^ {I ^pfN"^ COS (^l\ +2Tiq^p^{\-pfN-^C0s{ ^l 



(Stt 



for i = 1, . . . , A^ — 1. Because of the N ^ factors, we have that, as N ^ oo, 

,27r2/2 / p 



^ll = l- 4<z(l - q)p{\ - pf^ + " V 7V2 



Then, for N large enough. 



1 



esrL(rj) = _max^^^/ = l-8g(l-g)p(l -p)^— 2 ■ - i ,.2 



Finally, the bound on the rate follows from ([S|). D 

Remark 5.2 The speed of convergence for the Collision Broadcasting Gossip is one order faster than 
the Broadcasting Gossip. This is not surprising, since in the former case the average number of activated 
nodes per round is Np, instead of 1. 

Remark 5.3 (Rate Optimization) Remarkably, for N large enough both the upper and the lower 
bound on the rate show the same dependence on p. Thus, they can be simultaneously optimized by 
taking p* = 1/3. The behavior for large N can also be investigated by numerical computations, showing 
that esr(C) < esr(C + T), and esr(C + T) = 1 - Q{N^'^). This means that the perturbation T does not 
significantly affect the rate for large TV. Moreover, since esr(C) = 1 — Q{N^'^) and csr(C + T) — esr(C) = 
1 — <d{N~^), we argue that actually 

This is reminiscent of a similar observation about the BGA algorithm in Remark [ 

We can also numerically evaluate B and R as functions of p and q: these results are shown in Figure[S] 
Note that the asymptotical error does not depend significantly on p: this implies, from the design point 
of view, that p can be chosen equal to p* = 1/3, optimizing the convergence rate. Instead, choosing q we 
can trade off asymptotical error and convergence rate. 



6 Conclusion 

This paper has been devoted to study gossip algorithm for the estimation of averages, based on iterated 
broadcasting of current estimates, focusing on their capability of providing an unbiased estimation. In 
this framework, we presented a novel broadcast gossip algorithm, dealing with interference in communica- 
tions, whose impact is studied in the paper. Our results, obtained under specific symmetry assumptions 
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Figure 6: Dependence in the CBGA of tr {B) and R for the ring graph. Left plot assumes N = 30, 
p = 1/3, right plot assumes N = 30, q = 1/2. 

about the network topology, allow to conjecture an interesting picture of the performance of broadcast 
gossip algorithms on real world networks, in terms of achievable precision and robustness to interference. 
Interferences have a negative effect on the rate of convergence, which can be minimized by a suitable 
choice of the broadcasting probability p. As known for static consensus algorithms based on diffusion, 
the rate degrades on large locally connected graphs. Instead, interferences have minimal effect on the 
asymptotical error, which depends on the network topology: on large highly connected graphs, the algo- 
rithms provide biased estimations, whereas on locally connected graphs the estimation bias goes to zero 
as the network grows larger. 

A better understanding of the role of the network topology in the trade-off between speed and achiev- 
able precision might come from the following extensions of our work. In this paper two simultaneous 
technical assumptions have been made: the graphs are Abelian Cayley, thus in particular vertex-transitive, 
and each node broadcasts with the same probability. Future work should consider non-vertex-transitive 
networks of nodes with non-uniform broadcasting probabilities. Furthermore, it seems interesting to 
study the performance of the algorithms on families of expander graphs and small- world networks, which 
ensure a good scaling of the rate of convergence, while keeping the degree of nodes bounded. 

Acknowledgements: The authors wish to thank Sandro Zampieri for many fruitful discussions on the 
issues studied in this paper. 
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